Model Selection

4-bit efficient quantization

# 4-bit efficient quantization

GLM 4 9B 0414 4bit DWQ

A high-performance 4-bit DWQ quantized version of GLM-4-9B, optimized for Apple chips and supporting 128K long context.

Large Language Model

Qwen3 8b 192k Context 6X Josiefied Uncensored MLX AWQ 4bit

The 4-bit AWQ quantized version of Qwen3-8B, optimized for the MLX library, supports 192k token long context processing, suitable for edge device deployment.

Large Language Model

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase